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ORIGINAL ARTICLE 

Genomic copy number alterations in clear cell renal carcinoma: 
associations with case characteristics and mechanisms of VHL 
gene inactivation 

LE Moore 1 , E Jaeger 2 , ML Nickerson 3 , P Brennan 4 , S De Vries 2 , R Roy 2 , J Toro 1 , H Li 5 , S Karami 1 , P Lenz 6 , D Zaridze 7 , V Janout 8 , V Bencko 9 , 
M Navratilova 10 , N Szeszenia-Dabrowska 11 , D Mates 12 , WM Linehan 13 , M Merino 14 , J Simko 2 , R Pfeiffer 1 , P Boffetta 15 ' 16 , S Hewitt 17 , 
N Rothman 1 , W-H Chow 1 and FM Waldman 2 



Array comparative genomic hybridization was used to identify copy number alterations in clear cell renal cell carcinoma (ccRCC) 
patient tumors to identify associations with patient/clinical characteristics. Of 763 ccRCC patients, 412 (54%) provided frozen 
biopsies. Clones were analyzed for significant copy number differences, adjusting for multiple comparisons and covariates in 
multivariate analyses. Frequent alterations included losses on: 3p (92.2%), 14q (46.8%), 8p (38.1%), 4q (35.4%), 9p (32.3%), 9q 
(31.8%), 6q (30.8%), 3q (29.4%), lOq (25.7%), 13q (24.5%), 1p (23.5%) and gains on 5q (60.2%), 7q (39.6%), 7p (30.6%), 5p (26.5%), 
20q (25.5%), 12q (24.8%), 12p (22.8%). Stage and grade were associated with 1p, 9p, 9q, 13q and 14q loss and 12q gain. Males had 
more alterations compared with females, independent of stage and grade. Significant differences in the number/types of 
alterations were observed by family cancer history, age at diagnosis and smoking status. Von Hippel-Lindau {VHL) gene inactivation 
was associated with 3p loss (P<E-05), and these cases had fewer alterations than wild-type cases. The fragile site flanking the FHIT 
locus (3p14.2) represented a unique breakpoint among VHL hypermethylated cases, compared with wild-type cases and those with 
sequence changes. This is the first study of its size to investigate copy number alterations among cases with extensive patient, 
clinical/risk factor information. Patients characterized by VHL wild-type gene status (vs sequence alterations) and male (vs female) 
cases had more copy number alterations regardless of diagnostic stage and grade, which could relate to poor prognosis. 
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INTRODUCTION 

Renal cancer incidence is increasing 1 and Central/Eastern Europe 
has the highest rates worldwide. 2 Most (~90%) adult renal 
cancers are renal cell carcinomas (RCC), most of which are clear 
cell RCC (ccRCC). 3 ccRCC can be further classified by genetic 
features, most notably by the presence/absence of Von Hippel- 
Lindau (VHL) gene inactivation, which occurs through three 
mechanisms: sequence alteration affecting the transcript or 
protein, transcriptional silencing because of promoter CpG 
hypermethylation and loss of heterozygosity/copy number of 
the 3p25-26 locus. 4 

The VHL protein is involved in tissue-specific responses to 
oxygen concentration delivery. 5,6 VHL alteration through sequence 
changes and/or promoter hypermethylation, leading to 
transcriptional silencing, prevents formation of a protein 
complex required for hypoxia-inducible factor (HIF) degradation, 



resulting in excess HIF protein and a gene expression pattern 
facilitating adaption to oxygen deprivation, delivery and 
angiogenesis. VHL inactivation subtypes have been associated 
with improved prognosis and a favorable treatment response in 
some but not all studies. 7-11 In contrast, VHL wild-type cases may 
occur through mechanisms independent of VHL inactivation. 
Generally, VHL inactivation is considered an early event in ccRCC 
but other unidentified alterations may accompany VHL alteration 
at diagnosis. 

ccRCC subgroups have demonstrated different frequencies and 
patterns of VHL inactivation associated with known risk factors, 
which include obesity, hypertension, smoking, certain occupa- 
tional exposures and a positive family history of kidney cancer. 12,13 
High-intake of fruits and vegetables may reduce risk. 1 

In this study we used array comparative genomic hybridization 
to evaluate chromosomal alterations among incident ccRCC cases 
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enrolled in a large case-control study of renal cancer. Risk factors, 
tumor histopathological features and VHL gene inactivation were 
evaluated for association with the frequency and type of copy 
number alterations observed. 14-16 



RESULTS 

Cases with tumor biopsies {N = 41 2) were more likely to be 
residents from the Czech Republic, have a family cancer history and 
have a higher education level compared with those without biopsies 
(A/ =348) (Supplementary Table 1). Other factors did not differ. 

Fraction of genome altered (FGA) 

Males had a higher FGA compared with females (P = 0.002) 
(Figure 1). Stage and grade at diagnosis were associated with 
increasing fraction of the genome lost (FGL), fraction of genome 
gained (FGG) and total FGA (P< 0.0001). Ever smokers had 
elevated FGG (P = 0.05) and FGA (P = 0.05) compared with never 
smokers, but trends with current smoking status were not 
observed (P = 0.62). Individuals without a first degree relative 
with cancer had a higher FGA compared with those who did 
(27.9% vs 22.8%; P = 0.02). A lower FGA was observed in cases 
with VHL promoter hypermethylation (20.8%, P = 0.03) and 
sequence alteration (25.9%, P = 0.11), compared with wild-type 
cases (30.5%-referent). FGA was not associated with other factors. 

Chromosomal alterations 

The exact number and percent of all chromosome p and q arm 
gains and losses in all ccRCC cases in this study are shown in 
Supplementary Table 2. Chromosome arms altered in greater than 



25% of cases, and those of previous interest were included in 
multivariate models (Table 1). The strongest association was 
observed between 3p loss and VHL gene inactivation (P< 0.0001). 
3p loss was less frequent among VHL wild-type compared with 
inactivated cases (70.2% vs 95.3%, respectively). Therefore, 29.8% 
of VHL wild-type and 4.7% of inactivated cases had 3p present. No 
other factors with the exception of VHL inactivation were 
associated with 3p loss. Stage was strongly associated with 9p, 
9q and 13q (P< 0.0001), and to a lesser extent 1p, 10q and 14q 
losses. Stage was associated with 12p and 20q gains (P = 0.001). 
Grade was strongly associated with 8p and 14q (P = < 0.0001) and 
also 1p, 4q, 6q, 9p, 9q and 13q losses. Gains associated with grade 
were observed at 5p, 7p, 7q and 12q. 

After adjustment, male cases had more losses of 3q, 9p, 9q, 13q, 
14q and gains of 5p, 7q, 12p, 12q, 20q. Most of these alterations 
were associated with stage and/or grade. Patients 65 years or 
older had more 4q, 9p and 9q losses compared with younger 
patients diagnosed before age 50. Individuals that reported a 
family cancer history had fewer 7q gains and 8p losses, explaining 
the lower FGA observed. In contrast, there were no chromosome 
arm alterations associated with smoking status. Interestingly, 5q 
gain was frequently observed although it was not associated with 
any of the patient/clinical characteristics or risk factors evaluated. 
Closer examination revealed that gains occurred distally, specifi- 
cally at two regions (that is, 5q34-35 (55-58%), 5q31-32 (52-54%) 
and 5q21-23 (34-49%) (Supplementary Figure 1)). 

Loci 

Differences in copy number alteration prevalence at specific loci 
and risk factors were compared using a moderated f-statistic or F 
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Figure 1. Fraction (%) of the FGL, FGG and FGA among ccRCC case subgroups. P-values between subgroups were as follows: males vs females 
(P = 0.002), age (<50, vs ^50 years, P = 0.06), any family history of cancer (P = 0.02), stage (P<0.00001), grade (P<0.0001), ever vs never 
smoking (P = 0.05), VHL wild-type cases vs those with VHL promoter hypermethylation (P = 0.03), VHL wild-type cases vs those with VHL 
sequence alterations (P = 0.17). REF, referent group. 
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Table 1. Regression analysis of selected chromosomal arm alterations patient/clinical factors 



Chromosome arm 


/?/'s/c factors 


Odds ratio (95% confidence intervals) 


P-value 


1p- 


Grade: I, II (REF) vs III, IV 


2.4 (1.3-4.2) 


0.003 




Stage: T1 (REF), T2, T3-4 


1.6 (0.7-3.4); 3.7 (1.6-8.5) 


0.001 


3p- 


VHL gene inactivation: no (REF) vs yes 


1250 (476-3125) 


< 0.0001 


3q- 


BMI: <25 (REF), 25-30, 30 + 


2.5 (1.4-4.6); 1.9 (1.0-3.7) 


0.01 




Sex: female (REF) vs male 


2.4 (1.5-4.2) 


0.0004 


4q- 


Grade: I, II (REF), vs III, IV 


2.3 (1.4-3.6) 


0.001 




Age: <50 (REF), 50-60, 60 + 


1.6 (0.8-2.9); 3.0 (1.5-5.7) 


0.001 


5p + 


Grade: I, II (REF), vs III, IV 


1.6 (1.2-3.0) 


0.02 


6q- 


Grade: I, II (REF), vs III, IV 


2.1 (1.3-3.3) 


0.004 


7p + 


Grade: I, II (REF), vs III, IV 


2.1 (1.3-3.4) 


0.003 


7q + 


Family history cancer: no (REF) vs yes 


0.6 (0.4-0.9) 


0.02 




Grade: I, II (REF), vs III, IV 


2.1 (1.3-3.3) 


0.004 


8p- 


Grade: I, II (REF), vs III, IV 


3.0 (1.8-5.0) 


< 0.0001 




Family history cancer: no (REF) vs yes 


0.6 (0.4-0.9) 


0.01 


9p- 


Grade: I, II (REF), vs III, IV 


2.2 (1.3-3.7) 


0.003 




Stage T1 (REF), T2, T3-4 


3.3 (1.5-7.4); 6.3 (2.6-14.8) 


< 0.0001 




Age: <50 (REF), 50-60, 60 + 


1.9 (0.9-3.7); 3.1 (1.5-6.3) 


0.006 




Sex: female (REF) vs male 


2.0 (1.2-3.1) 


0.007 


9q- 


Grade: I, II (REF), vs III, IV 


2.5 (1.4-4.2) 


0.001 




Stage T1 (REF), T2, T3-4 


3.9 (1.6-9.2); 7.5 (3.0-18.6) 


< 0.0001 




Age: <50 (REF), 50-60, 60 + 


1.6 (0.8-3.2); 2.9 (1.4-5.9) 


0.007 




Sex: female (REF) vs male 


2.4 (1.5-4.0) 


0.0006 


12p + 


Stage T1 (REF), T2, T3-4 


2.8 (1.2-6.6); 5.0 (2.1-12.2) 


0.001 




Sex: female (REF) vs male 


2.1 (1.3-3.7) 


0.005 


12q + 


Stage T1 (REF), T2, T3-4 


1.2 (0.9-5.2); 4.8 (2.0-11.9) 


0.001 




Grade: I, II (REF), vs III, IV 


2.2 (1 .8-4.0) 


0.01 




Sex: female (REF) vs male 


2.3 (1.2-4.4) 


0.01 


13q- 


Stage T1 (REF), T2, T3-4 


4.4 (1.3-15.1); 13.3(3.8-46.6) 


< 0.0001 




Grade: I, II (REF), vs III, IV 


3.2 (1.9-5.6) 


< 0.0001 


14q- 


Organic solvents: no (REF) vs yes 


1.9 (1.1-3.2) 


0.03 




Sex: female (REF) vs male 


1.9 (1.1-3.1) 


0.03 




Stage T1 (REF), T2, T3-4 


1.5 (0.8-2.9); 3.1 (1.5-6.7) 


0.006 




Grade: I, II (REF), vs III, IV 


4.4 (2.34-8.2) 


< 0.0001 


20q + 


Stage T1 (REF), T2, T3-4 


1.3 (0.7-2.8); 3.1 (1.5-6.5) 


0.001 



Abbreviations: BMI, body mass index; VHL, Von Hippel-Lindau gene. Adjusted for Center, age, sex, stage, grade and VHL gene inactivation. 



statistic with pooled variance, controlling the false discovery rate 
(FDR). High frequency clonal alterations were sorted initially by 
FDR-adjusted P-values. Subsequently, significant loci were re- 
analyzed in adjusted models (Table 2). Grade was associated with 
loss of large regions on 4q, 9q, 13q, 14q, 18q and gains on 1q, 7p. 
Stage was associated with large losses on 9p, 9q, 13q, 14q, 17q, 
18q and gains on 3q and 7q. Both stage and grade were 
associated with loss of 1p36; 4q 11-12,26; 9p21-22; 9q21-21.3; 
14q13-14, 21, 24, 31-32; 18p1 1.2-12, 18q11-12, 21.3-23, tel; and 
gains on 3q22-23; 17q24-25; and 20q13. Both stage and grade 
were associated with losses on 14q23.2, a region harboring the 
HIF1A gene, and 14q13.2 harboring the PHD1-EGLN3 gene. We 
examined HIF1A protein expression in tumor tissue microarrays 
using immunohistochemistry, in relation to 14q and regional 
(clonal) loss across the HIF1A region (14q23). Although univariate 
analyses suggested a decrease in expression with 14q and 14q23 
loss, neither factor was associated with HIF1A expression after 
adjustment for stage and grade. Stage and particularly grade 
remained strongly associated with HIF1 A expression, independent 
of 14q loss (data not shown). 

Male cases exhibited losses on 6p22, 6p24-25, large regions of 
9q and gains at 1q43-44, 12p12.3, 12q12.4, 24-tel, compared with 
females (Table 3). These alterations were also associated with 
stage and grade. Among cases reporting a positive family cancer 
history, fewer cases demonstrated losses at 4q21-23 (11% vs 24%, 
P = 0.002) and gains at 20q13.2 (13% vs 26%, P = 0.002). Former 
and current smokers exhibited more loss at 3q12 (24% vs 40%, 
P = 0.04) and gain at 16p1 1.2-13 (22% vs 11%, P = 0.003). 
Interestingly, a significant positive trend was observed with 
current smoking status and gain at 16p1 1.2-13.2 (P-trend = 0.003). 



Lastly, copy number alteration of specific clones and VHL 
inactivation status were evaluated. The lowest FDR-adjusted 
P-values that remained significant in multivariate analyses were 
losses observed at clones RP1 1-245 E5 (3p24; P< 0.0001) and RP1 1- 
180G14 (3p14.3; P< 0.0001). RP11-180G14 (3p14.3) is located just 
distal to CTD-2175D15 (3p14.2), a clone harboring the FHIT locus 
(Figure 2). 

To follow-up upon these observations, the prevalence of all 3p 
losses was compared between groups stratified by VHL inactiva- 
tion characteristics. Clonal alteration prevalence among wild-type 
cases were compared separately to cases demonstrating inacti- 
vated through VHL promoter hypermethylation or sequence 
alteration. Loss of these 3p clones was significantly greater among 
cases with VHL sequence alterations compared with wild-type 
cases. Losses were significantly more prevalent at the 3p telomere 
(93% vs 59%, P<0.00001), and clone CTB110-J24 (3p25-26;V HL) 
(95% vs 62%, P< 0.00001) (Region A-Figure 2), and at 3p14.2 (FHIT) 
(92% vs 75%, P = 0.001) but not at regions proximal to the 
centromere. Loss of 3p loci among VHL hypermethylated cases 
was similar in prevalence to cases with sequence alterations but 
significantly higher compared with wild-type cases at the 3p 
telomere (83%, P = 0.03) and the VHL locus (91%, P = 0.0006) 
(Region A-Figure 2), but not significantly different to wild-type 
cases at loci centromeric to clone RP11-180G14 (3p14.3; Region 
B-Figure 2). Loss among hypermethylated cases was significantly 
less prevalent than among cases with sequence alterations at 
3p14.2 (84%, P=0.04) and regions proximal to the centromere 
(50%, P = 0.0002). These findings demonstrate differences in 3p 
loci loss between VHL inactivation subgroups, in particular clones 
surrounding the fragile FHIT locus. Lastly, VHL wild-type cases were 



© 2012 Macmillan Publishers Limited 



Oncogenesis (2012), 1 -9 



Genomic copy number alterations in ccRCC 
LE Moore et al 



Table 2. Regional clone and copy number alterations and ccRCC tumor stage and grade 


Chromosome 


Clone with 


Regions altered 


btart (Mo) 


tna 


P-value a 


brade I, ll (%; 


UR 


95% LI 


P-value 


arm 


highest signal 










vs III, IV (%) 






adf 


Grade 




















1p- 


RP11-60J11 


1p36 


7041.726 


7 232439 


0.0041 


14/33 


2.94 


(1.66-5.21) 


0.00001 




(1p36.2) 


















1q + 


RP11-104A2 


1q25, 31-41, 45 


206402.345 


2.07E + 08 


1.58E-05 


9/31 


4.42 


(2.34-8.36) 


0.00001 




(1q32-41) 


















3q + 


RP11-219P10 


3q22-23 


137 606.200 


1.38E + 10 


0.0037 


12/25 


1.96 


(1.07-3.61) 


0.03 


4p- 


RP1 1-1 7614 


4p14-16 


33 086.961 


33 087 410 


5.85E-05 


14/33 


3.10 


(1.73-5.57) 


0.0001 




(4p15.1) 


















4q- 


RP11-6K22 


4q 11-13, 24,26- 


120282.900 


1.20E + 08 


7.79E-06 


13/32 


3.43 


(1.88-6.26) 


0.0001 




(4q 26-27) 


31,34,35 
















6q- 


RP-2202A14 


6q14 


79297.260 


79474 213 


0.0034 


19/37 


2.09 


(1.22-3.56) 


0.007 




(6q14) 


















7p + 


RP11-161C7 


7p1 3-1 4,21 -22.3 


6215.911 


6 396692 


0.0002 


2543 


2.27 


(1.36-3.79) 


0.002 




(7p22.3) 


















7q + 


RP11-10D8 


7q22.1-.3 


98067.790 


98216 726 


3.78E-05 


28/46 


2.17 


(1.31-3.60) 


0.003 




(7q22.1) 


















Q n 
°P 


DD1 1 1 Qlfll 
Wr 1 1 - 1 OZoz 


opZZ-Z3 


44D / .OOZ 


4 D 1 4 1 jU 






Z.Dy 


\ 1 .DO— 4.Z/J 


U.UUU I 




(8p22) 


















9p- 


RP11-85J5 


9p21-22 


24524.208 


24524427 


1.97E-05 


26/49 


2.32 


(1.38-3.92) 


0.002 




(9p21.3) 


















9q- 


RP11 -106017 


9q21-33 


94882.444 


94882 881 


8.72E-06 


25/47 


2.19 


(1.30-3.68) 


0.003 




(9q22.3) 


















11q- 


RP11-42L18 


11q22-23 


107422.086 


1.08E + 08 


2.32E-05 


8/25 


3.81 


(1.93-7.53) 


0.0001 




( 1 1 QZZ.3-Z3) 


















12q + 


RP11-89J2 


12q13-14, 24.32 


124666.600 


1.25E + 08 


0.0005 


17/29 


1.81 


(1.26-3.68) 


0.04 




(12q24.32) 


















13q- 


RP11-11C5 


13q12-14.3, 21-tel 72013.886 


72183 774 


2.34E-06 


10/41 


5.36 


(2.94-9.77) 


0.00001 




(13q21.3) 


















14q- 


RP11-26M6 


14q12-14, 21.3- 


34404.822 


34591 592 


2.34E-06 


34/67 


3.18 


(1.91-5.28) 


0.00001 




(14q13) 


32.2 
















17q + 


RP11-84E24 


1 7q24-25, tel 


67 512.626 


67 693 011 


3.78E-05 


5/22 


4.50 


(2.16-9.39) 


0.00001 




(17q24-25) 


















1 Rn 
I op — 


v_ 1 U ZU4or 1 / 


18p1 1.21-1 1.32,12 8396.160 


o jyo ouj 


j. D 1 L"UD 


I H/ JO 


O.ZH 


( 1 OA ^ 


n nnnm 

U.UUUU 1 




(18p12) 


















18q- 


RP11-60P1 


18q 11-12, 21.3-23, 59072.800 


59251 211 


3.44E-06 


16/44 


3.80 


(2.20-6.55) 


0.00001 




\ i oqz 1 .5) 


tel 
















zUq + 


L I UZZ5or I o 


20q12,13 


45 712.070 


45 864616 


0.0001 


17/57 

1 115/ 


z.zo 


{ 1 .3Z-3.y4J 


U.UUo 




(20q12) 


















Chromosome 


Clone with 


Regions 


Start (Mb) 


End 


P-value a 


Stage 1, II (%) 


OR b 


95% CI 


P-value 


arm 


highest signal 


altered 








vs III, IV (%) 






adj° 


Stage 




















1p- 


RP11-219F4 


1 p36.2-36.3 


14 774.025 14 774291 0.002 


17/29 


2.16 


(1 .26-3.72) 


0.005 




(36.2-36.3) 


















2p + 


RP11-57F9 


2p23 


28 514.930 28 515 223 0.0056 


13/23 


1.89 


(1.06-3.38) 


0.03 


2q + 


CTD- 


2q22-24.3 


158357.236 158357 713 0.0034 


9/20 


2.94 


(1.52-5.71) 


0.001 




22500B20 




















(2q22) 


















oq + 


DDI 1 1 AHA 1 3 
r\r III OUn I o 


22-24, 26-29, 


144154.808 144312 333 0.0015 


1 Zl ZD 


1 74 


(ft Q7 O 1 1 \ 


n n^ 




(3q23-24) 


tel 
















4p- 


RP11-126C7 


11-12,13 


38087.290 38 242.92 0.0066 


12/26 


2.24 


(1.32-3.41) 


0.03 




(4p I .5 J 


















4q- 


RP11-101N17 


4q11-12, 26 


120563.175 120 720 926 0.0003 


14/29 


2.42 


(1.35-4.43) 


0.003 




(4q26) 


















6p- 


RP11-284E1 


6p12, 24-25 


53 823.776 53 824135 0.0003 


14/30 


2.48 


(1 .37-4.42) 


0.001 




(6p12) 


















7q + 


RP11-38P11 


7q26.2-.3, 


129557.600 129557 976 0.0018 


26/41 


1.63 


(1 .02-2.63) 


0.04 




(7q32) 


32,34,36.3 
















9p- 


RP11-125K10 


9p21-24, tel 


4819.655 4991 841 8.63E-0.05 


28/49 


2.17 


(1.33-3.53) 


0.002 




(9p24) 


















9q- 


RP11-8D10 


9q 12-21.3 


82 1 20.447 82 209 078 8.63E-0.05 


28/46 


2.28 


(1.41-3.70) 


0.001 




(9q21.3) 
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Table 2 (Continued) 



Chromosome 


Clone with 


Regions 


Start (Mb) 


End 


P-value a 


Stage 1, II (%) 


OR b 


95% CI 


P-value 


arm 


highest signal 


altered 








vs III, IV (%) 






adf 


12p + 


RP11-78F16 
(12p1 1.2) 


12p1 1.2,1 2.1- 
13.1 


31 959.002 


32 128568 


0.0005 


20/32 


2.04 


(1 .20-3.45) 


0.008 


12q + 


RP11-89E9 
(12q23) 


12q23.3 


97 222.144 


97 411 481 


0.0005 


18/31 


2.14 


(1 .26-3.64) 


0.005 


13q- 


RP11-8C15 
(13q12.1) 


13q12.1J2.3- 
1 4.3,2 1.1-tel 


20 985.475 


20 993 099 


8.63E-0.05 


15/38 


3.72 


(2.11-6.57) 


< 0.00001 


14q- 


RP11-206J4 
(14q23) 


14q 13-24, 31- 
32, tel 


63 996.230 


64080508 


0.0003 


38/62 


2.36 


(1.48-3.75) 


< 0.00001 


16p — 


CTD-2032H1 


16p13 


3232.028 


3 296628 


0.0104 


14/28 


2.01 


(1.28-3.33) 


0.01 


17q + 


RP11-86L18 


17q12- 
25.3,36 


153 645.400 


153 645 775 


0.0003 


27/43 


1.63 


(1.02-2.60) 


0.04 


18p- 


RP11-106J7 
(18p1 1.21- 
.23) 


18p1 1.2-12 


3268.728 


3 443 000 


0.0002 


13/35 


3.15 


(1.81-5.48) 


< 0.00001 


18q- 


RP11-12J12 
(18q22) 


18q11-12, 21- 
23, tel 


55 424.497 


55 607 583 


3.14E-0.06 


27/43 


2.84 


(1.68-4.88) 


< 0.00001 


20q + 


LLNL-255K9 


20q13.2 


55 338.240 


55 352455 


0.0005 


15/34 


2.24 


(1.31-3.82) 


0.003 



Abbreviations: ccRCC, clear cell renal cell carcinoma; CI, confidence interval; OR, odds Ratio. a P-value adjusted for multiple comparisons. b OR, 95% CI and P- 
values from multivariate models(MV) adjusted for age, sex, country stage, grade, smoking status and family history of cancer. 



Table 3. 


Regional clone and copy number alterations and patient/tumor characteristics 










Sex 


Clone with 


Region 


Start (Mb) 


End 


P-value 


Male (%)/ 


OR 


95% C/ b 


P-value b 




highest signal 


altered 






adf 


female (%) 


(MV) b ' c 






1q + 


RP11-188A4 


1q43-44 


239 316.500 


2.39E + 08 


0.002 


17/9 


2.56 


(1.43-4.55) 


0.04 




(1q43-44) 


















6p- 


RP11-52C20 


6p22, 24-25 


23 943.950 


24112567 


0.01 


22/11 


2.22 


(1.18-4.17) 


0.003 




(6p22) 


















9q- 


RP11-19014 


9q13, 21.1-22, 


70455.470 


70634947 


0.003 


38/21 


2.13 


(1 .30-3.57) 


0.001 




(9q13) 


33 
















12p + 


RP11-174L5 


12p12.3 


14 735.670 


14 736047 


0.001 


38/19 


2.44 


(1.45-4.00) 


0.001 




(12p12.3) 


















12q + 


RP11-29G23 


12q12.4, 24- 


43 908.780 


43 909199 


0.02 


25/14 


1.92 


(1.11-3.33) 


0.004 




(12q12.4) 


tel 
















Family history 










No (%)/yes (%) 








4q- 


RP11-7G22 


4q21-23 


84 523.610 


84 723458 


0.002 


24/11 


2.70 


(1.45-5.26) 


0.002 




(4q21.3-22) 


















20q + 


RP11-15M15 


20q13.2 


51 109.650 


51 266870 


0.003 


26/13 


2.38 


(1.41-4.55) 


0.002 




(20q13.2) 


















Ever smoker 










Never (%)/ever (%) 






16p + 


RP11-150K5 


16p1 1.2-13.2 


29825.190 


29 825 404 


0.003 


11/22 


2.72 


(1.42-5.21) 


0.003 




(16p11.2) 


















3q- 


CTD-2014B13 


3q12-14 


100 971.500 


1.01E + 08 


0.03 


24/40 


1.73 


(1.06-2.83) 


0.04 




(3q13.2) 


















Von Hippel-Lindau gene alteration 








Wild-type (%)/altered (%) 






3p- 


RP11-180G14 


3p14.3 


59 538.560 


59 538883 


0.02 


71/92 


4.88 


(2.23-10.68) 


< 0.00001 




(3p14.3) 




















RP11-245E5 


3p21-25, tel 


22 747.910 


22 936416 


1.97E-07 


63/95 


13.48 


(6.01-30.20) 


< 0.00001 




(3p24) 


















20q + 


LLNL-255K9 


20q 11-13.2 


55 338.240 


55 352455 


0.03 


33/20 


2.08 


(1.04-4.17) 


0.03 




(20q13.2) 


















Abbreviations: CI, confidence interval; MV, multivariate model; OR, Odds Ratio. a P-value adjusted for multiple comparisons. b OR, 95% CI and P-values from MV 


adjusted for age, sex, country stage, grade, smoking status and family history of cancer. c ORs presented for most prevalent comparison group (underlined). 



distinguished by having more gains at clone LLNL-255K9 (20q11- 
12) compared with inactivated cases (33% vs 20%, P = 0.03) 
(Table 3), a clone that was also more prevalent in cases with 
higher stage. 



Amplifications 

High-level amplifications were observed at 154 clones among 24 
cases (1-28 amplifications/case). The most frequent amplifications 
were located at clones RP1 1-28111 (1 1 pi 5.2-1 5.4; N = 6), RP11- 
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Figure 2. Frequency of copy number loss of 3p clones among ccRCC cases grouped by absence (wild-type) or presence of VHL gene 
inactivation (through sequence alteration or promoter hypermethylation). Region A (telomere to RP1 1-180G14): the frequency of copy 
number loss of individual 3p clones differed significantly between VHL wild-type and inactivated cases that occurred through both sequence 
alteration (P<0.00001) and promoter hypermethylation (P = 0.03-0.001). Region B (from RP1 1-180G14to RP1 1-154H23): the frequency of 
clonal copy number loss differed significantly between wild-type cases and those with sequence alterations (P = 0.003- < 0.00001), but was no 
longer observed with hypermethylated cases (P = 0.08-0.59). 



10M23, RP11-91H14, RP11-121G20 (6p21 .1-21 .2; N = 5) and CTB- 
136014 (12q14; N = 3). Two cases (one VHL wild-type with 3p 
present; one VHL inactivated with 3p loss) demonstrated 2q/6p co- 
amplifications. The 2q overlapping region (~ 179.4-1 99.8 Mb) 
covers ~100 genes; the 6p overlapping region (~ 39.1 -45.7 Mb) 
included the VEGF (vascular endothelial growth factor) gene. 



DISCUSSION 

Previous studies have utilized copy number profiling to classify 
histologically categorized RCC cases into molecularly defined 
subtypes (Table 4). 17-22 Consistent with previous studies, the most 
common alterations were loss of 3p, 14q and gain of 5q. 17-22 Two 
previous comparative genomic hybridization studies examined 
associations between RCC copy number variation with stage and 
grade. 18,20 One study (51 cases) reported two clusters defined 
solely by the alteration subtypes observed. 18 In both clusters, 3p 
loss and 5q gain were frequently observed, however, one cluster, 
defined by losses on 1 p, 4, 9, 1 3q and 1 4q, had a higher stage and 
grade, increased vascular involvement and lower overall survival 
that were independent of stage and grade. 18 Another study (51 
cases) reported a difference in genomic copy number and 
expression profiles between cases with/without biallelic VHL 
loss. 18 A study of 80 cases observed significant associations with 
stage and loss of 14q and 18p. 20 In the current study, both stage 
and grade were associated with losses on 14q harboring HIF1A 
(14q23.2) and PHD1-EGLN genes (14q13.2). Recently, genetic and 
functional studies reported that HIF1A is a target of 14q loss and 
that HIF1A activity is diminished in 14q deleted RCC cases. 21 Loss 
of these loci being associated with 14q loss was not corroborated 
in the current study, perhaps because of differences in type of 
study design and laboratory methodologies. In the current study, 
loss of specific loci spanning 14q23 was associated with stage and 
grade, which is consistent with the poor prognosis reported in 14q 
deleted cases. 18 

This is the first study to evaluate 3p loci loss among large 
numbers of cases that were also defined by VHL status. In 



multivariate analyses, only VHL gene alteration was associated 
with 3p loss. Cases with VHL promoter hypermethylation were 
similar to those with sequence alterations at loci distal to the FHIT 
locus, and to wild-type cases at loci centromeric to FHIT. This 
observation suggests that in hypermethylated cases, the FHIT 
breakpoint could be critical in terms of somatic copy number loss 
of the chromosome segment distal to 3p14.2. In contrast, the FHIT 
fragile site was not a significant breakpoint in VHL wild-type or 
cases with sequence alterations. The FHIT region spans the most 
common fragile site of the human genome (the FRA3B fragile site) 
encompassing the previously observed familial kidney cancer 
breakpoint (t(3;8) p14.2;q24) and a viral integration site. 23-25 
Hemizygous, interstitial or terminal 3p deletions involving the FHIT 
gene and reduced protein expression have been observed in over 
90% of ccRCC cases, similar to that observed in the current study 
(89%). 

In addition to heterogeneity across 3p by VHL inactivation 
subgroups, we found that wild-type tumors had more alterations 
compared with those with VHL inactivation through promoter 
hypermethylation; corroborating a previous report that wild-type 
cases may be less genetically stable than hypermethylated cases, 
and possibly those with sequence alterations 20 The higher 
genomic instability among wild-type cases suggests a greater 
potential for progression, as copy number alterations have been 
associated with tumor stage, grade and worse prognosis. 

Analyses of patient characteristics revealed that male cases had 
more alterations compared with females. The most prevalent copy 
number alterations in males occurred independently of stage and 
grade. This could indicate that incident male ccRCC tumors could 
be more aggressive compared with female cases, regardless of 
stage or grade at diagnosis. Male patients might also benefit from 
more aggressive treatment, follow-up and their genetic alterations 
could provide clues for targeted therapies. Several target genes on 
chromosome 9 include CDKN2A and CD/CA/28. 19,20 Other risk 
factors examined did not explain the sex differences observed. 
Additional characteristics associated with tumor heterogeneity 
included having a first degree relative with cancer in which fewer 
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Table 4. List of published comparative genomic hybridization (CGH) studies of kidney and renal cell carcinoma (RCC) 



Manuscript Sporadic Other tissues Method Factors evaluated Finding 

ccRCC evalauted 
Cases 



Wilhelm et al/ 



Toma et al. 



Arai et al. 



Chen et al. 20 



Beroukhim 

et al.' 9 



Shuib et al. 2 ' 
(data com- 
bined with 
Beroukim 
et al.) 

Moore et al., 
current study 



16 



22 



13 Papillary, 

5 chromophobe, 

6 Oncocytoma, 
2 normal kidney 
22 Matched 
normal tissue 



51 NA 



80 NA 



49, 5 
Metastases 



22 VHL 
wild-type 
(WT) 



412 Clear 
cell RCC 



36 Tumors/12 
VHL patients 



57 VHL patients 



Array CGH- 
329 clones 



Affymetrix 
10k chip 



array CGH 
(4361 BACs) 



lllumina 317K 
SNP array 



Affymetrix 
250K Styl 



Affymetrix 
250K 



Array CGH 
(2464 BACs) 



RCC subtype 
segregation 



Survival 



qPCR, 

immunohistochemistry 
(IHC) 



Survival, grade, stage, 
vascular Involvement 
Promoter methylation 
in 9 genes 

Stage, grade, telomere 
length 

Demographics, 
smoking, 

alcohol, medical Hx 



Von Hippel-Lindau 
(VHL) disease, VHL 
inactivation, grade, 
Gene expression 
differences 



VHL disease 
(germline) 



VHL inactivation, 
age, sex, stage, grade 
Body mass index, 
hypertension, 
family cancer Hx, 
Occupational solvent, 
lead, pesticide, 
exposures 
LINE! methylation 



Segregation of kidney tumor histological sub- 
types by copy number variation (CNV) differences 
ccRCC 3p - , 5q + , and 6q - , 8p - , 
9p- or 14q - 

CNV most frequent at 3p - , 5q + , 1 4q - , 

9-,7q+, 6q- 

Novel microdeletions at Chr1 , Chr 4, Chr6, 
Chr8 and Chr 9 

Some alterations validated using gene and 
immunohistochemical (IHC) protein 
expression assays 

CNV most frequent at 3p - , 5q + , 7 + 
Genetic cluster profiles associated with 
survivial, independent of grade, stage 

CNV most frequent at 3p-, 5q+, 8p12-pter, 

6q23.3-27, 14q24.1-qter, 9q32-qter, 

10q22.3-qter, 9p13.3-pter, 4q28.3-qter, 

1q25.1-qter, 7q21.13-qter, 8q24.12-qter 

Loss of heterozygosity (LOH) 9p, 9q, 14q, 

18q associated with grade 

LOH 14q, 18p, 21 q associated with stage 

LOH 2q, 6p, 6q, 9p, 9q, 1 7p associated 

with short telomere length 

CNV most frequent at 3p - , 1 4q - , 5q + 

CNV higher in VHL WT compared with 

VHL disease cases 

LOH 1 p, 3p (VHL), 4q, 6q, 8p, 9p 

(CDKN2A, 2B) and 14q 

Amplification 1q, 2q, 5q, 7q, 8q (MYC), 

12p and 20q 

CNV most frequent at 3p - , 1 4q - , 5q + 
7p + , 8p- (TNFRSF10C, DUSP4) associated 
with germline VHL clear cell RCC 
17p- (TP53), 2p+ (HIF2A) associated with 
VHL wild-type ccRCC 

CN loss most frequent at 3p, 14q, 8p, 4q, 9p, 

9q, 6q, 3q, lOq, 13q, 1q, 

CN gain at 5q, 7q, 7p, 5p, 20q, 12q, 12p 

Stage and grade associated with 1p-, 9p-, 

9q - 13q - , 14q - , 12q + 

CNV higher among VHL wild-type compared 

to hypermethylated cases 

LOH across 3p varied by VHL inactivation status 

CNV higher in male compared to females 

cases, independent of grade, stage 

CNV higher in cases without positive family Hx 

cancer 

Smoking associated with 16p1 1.2-13.2 gain 



Abbreviations: ccRCC, clear cell renal cell carcinoma; CGH, array comparative genomic hybridization; NA, not applicable. 



copy number alterations were observed compared with those 
without cancer history. Smoking was associated with higher FGA. 
Ever smoking was associated with copy number loss of 3q12-14, 
gain of 16p1 1.2-13.2 for which a significant, positive trend with 
current smoking status was found. 

As previously reported, 5q gain was highly prevalent and was 
not associated with of stage, grade, VHL inactivation suggesting 
that these loci may provide clues to the location of additional 
genes that are also altered in the early stages of ccRCC 
development. Two recent studies that employed SNP arrays 19,20 
also observed gains at the distal end of 5q. Expression analysis 
revealed overexpression of 12 of 22 5q genes in tumor compared 
with normal cortex, including: GNB2L1, MGAT1, RUFY1, RNF130, 
MAPK9, CANX, CNOT6, SQSTM1, LTC4S, TBC1D9B, HNRPH1, and 
F/T4. 19 A second study reported focal amplification of 5q35.3 



containing the SQSTM1 gene that is overexpressed in other 
cancers. 20 The weak correlation between 5q gain and 3p loss 
across studies suggests that new targets, perhaps functioning 
through VHL-HIF independent mechanisms, could be identified 
on 5q. 

This study included a large series of histologically-confirmed 
ccRCC cases with detailed risk factor and patient/clinical informa- 
tion. Our two-step analytical approach, entailing FDR adjustment 
and subsequent multivariate analyses of significant loci, reduced 
possible false-positive reporting. Weaknesses included a low 
sample size for interaction analyses and potential selection bias 
from exclusion of almost 50% of cases. We also used a relatively 
low resolution array compared with some studies. However, the 
current study utilized available high-quality information on each 
patient and RCC risk factors to an extent not previously evaluated 
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in earlier studies. Results from reports that employed higher 
resolution techniques will complement our study findings and 
enable identification of smaller target regions/genes for follow-up. 

In summary, we identified novel associations between patient/ 
clinical characteristics and copy number alterations in a well- 
defined ccRCC case-series. Considerable heterogeneity spanning 
the FHIT gene locus suggests that this fragile site could be critical 
in terms of chromosomal loss of regions distal to 3p14.2 in cases 
with VHL promoter hypermethylation. Our findings also suggest 
that male tumors may be less genetically stable compared with 
those in female cases, and may benefit from more aggressive 
follow-up. Lastly, this study provides evidence that loci on 5q may 
provide clues to RCC carcinogenesis that could be distinct to the 
VHL-HIF pathway. 



MATERIALS AND METHODS 

Study population 

Incident cases (ICD-02 code C64) were participants in a case-control study 
of RCC conducted in seven centers in four Central and Eastern European 
countries (Moscow, Russia; Bucharest, Romania; Lodz, Poland; and Prague, 
Olomouc, Ceske-Budejovice, and Brno, Czech Republic). 14,15 Diagnostic 
slides were re-reviewed by an expert renal cancer pathologist for 
standardized classification (MM). Of 1097 cases, 763 were ccRCC, and 
frozen biopsies were available from 412 (54%) cases. Protocols were 
approved by ethics committees and institutional review boards of 
participating centers, the International Agency for Research on Cancer 
and the US National Cancer Institute. All patients and physicians provided 
written informed consent. Patients were asked about lifestyle habits, 
height/weight one year before diagnosis, personal/family medical history 
and diet. 13 " 16 

Array comparative genomic hybridization 

Areas of frozen biopsies containing ^70% tumor cells were macro- 
dissected and non-tumor tissue was removed. 13 A standard protocol 
(http://cc.ucsf.edu/people/waldman/Protocols) was used to extract DNA. 
Arrays included 2464 BACs at approximately Mb intervals along the 
genome, printed in triplicate. 22,26 DNA samples (500 ng) were labeled as 
described previously. 27 

VHL 

Inactivating alterations in exons 1-3 of the VHL gene and promoter 
hypermethylation were assessed as described previously. 13 Wild-type 
tumors included those without sequence alterations affecting the protein 
coding region, splice junctions or promoter hypermethylation. 

Data processing 

Data were processed using SPROC to filter data points with low DAPI 
intensity, reference to DAPI signal intensity and correlation between Cy3 
and Cy5 within spots. Ratios that were derived from a single clone or those 
with a triplicate log2 SD >0.2 were considered missing. Samples were 
hybridized on two versions of human array (3.1, 3.2). Clone data were 
combined sharing data from both arrays. Post-processing, 2111 clones 
remained. Data were mapped to the human DNA sequence May 2004 freeze. 

Data were segmented using circular binary segmentation, 28 
implemented in the DNAcopy package from Bioconductor 29 to translate 
intensity measurements into equal copy number regions. Median absolute 
deviation, scaled by the factor 1.4826 of the difference between the 
observed (log2 ratio) and segment values of autosomal clones, were used 
to estimate sample-specific experimental variation. Clone status outcome 
assignment (gain/loss/normal) was conducted, applying the merge levels 
procedure to segment values 30 To identify single technical or biological 
outliers (that is, high-level amplifications), their presence was allowed 
within each segment. Outlier clones included those for with an observed 
value at least four sample-specific median absolute deviation from the 
segment. High-level amplifications included narrow and single outliers 
with high copy numbers, as described previously 31 

To quantify the total amount of genome altered, each clone was 
assigned a genomic distance equal to the sum of one half the distance 
between its center and that of neighboring clones or to the chromosome 
end for probes with one neighbor 31 FGA was defined as the proportion of 



the distance of the autosomal clone regions altered to that of the entire 
genome. FGG or FGL, considered only distances that were gained or lost. 

Clone-wise association tests between copy number alterations and 
clinical variables were based on the segment values with observed values 
for outlier clones. Moderated f-tests for dichotomous variables and 
moderated F-statistics 32 were used for more than two groups. P-values 
were adjusted for multiple testing by controlling the FDR. 32,33 Wilcoxon's 
rank-sum tests were performed to evaluate differential associations of 
summary genomic events (that is, FGA, FGG, FGL, amplifications and 
specific alterations) with dichotomous variables. Kruskal-Wallis rank-sum 
tests were used for events having more than two groups. Statistical 
analyses of autosomes were performed using R/Bioconductor 29,34 P-values 
<0.05 were considered significant 33 

Copy number alterations 

The prevalence of ccRCC cases with/without frozen biopsies were 
compared by patient characteristics using j 1 and Fisher's Exact tests. 
Alterations of chromosome arms were coded as '0' (no alterations) and T 
(dominant change), defined as gain or loss in >25% of cases. Also 
included were those previously reported as important alterations in renal 
cancer. To compare cases with/without p or q arm alterations, cases with 
gains/losses were considered as dependent variables in stepwise logistic 
regression models with patient/clinical characteristics, lifestyle and 
occupational exposure variables. Criteria for inclusion into multivariate 
models was a P-value <0.20. Study design matching variables (country, 
sex, age) were included in all models. All variables were fitted in logistic 
models to obtain odds ratios (OR) and 95% confidence intervals (95% CI) as 
risk estimates of each alteration. Individual clone copy number differences 
between groups were initially tested using a moderated f-statistic or 
F-statistic with pooled variance, adjusted for multiple comparisons 
controlling the FDR. 32 Clones identified through the first screen, 
observed in >20% of cases, were included in multivariate analyses 
conducted using SAS 9.1.3 (SAS Institute Inc.) and STATA 10.0 software. 
Clone-wise tests were conducted in R, 34 all were two-sided. 
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